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Abstract 

A number of papers claim that a Log Periodic Power Law (LPPL) fit- 
ted to financial market bubbles that precede large market falls or 'crashes', 
contain parameters that are confined within certain ranges. The mecha- 
nism that has been claimed as underlying the LPPL, is based on influence 
percolation and a martingale condition. This paper examines these claims 
and the robustness of the LPPL for capturing large falls in the Hang Seng 
stock market index, over a 30-year period, including the current global 
downturn. We identify If crashes on the Hang Seng market over the pe- 
riod 1970 to 2008. The fitted LPPLs have parameter values within the 
ranges specified post hoc by Johansen and Sornette [T2] for only seven of 
these crashes. Interestingly, the LPPL fit could have predicted the sub- 
stantial fall in the Hang Seng index during the recent global downturn. 
We also find that influence percolation combined with a martingale con- 
dition holds for only half of the pre-crash bubbles previously reported. 
Overall, the mechanism posited as underlying the LPPL does not do so, 
and the data used to support the fit of the LPPL to bubbles does so only 
partially. 



1 Introduction 

Two broad finance theories make predictions about stock prices. They are 
the efficient market hypothesis (EMH) and the rational bubbles view (RBV). 

*We acknowledge the great help provided by Anders Johansen particularly in answering 
the many queries that we had about his data analysis and in providing us with the program 
that he used to estimate the parameters of the LPPL. 
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Both theories begin from the standpoint that an asset has a fundamental value, 
defined as the market's expected discounted present value of the future cash 
flows associated with the asset. Empirical tests of both the EMH and the RBV 
often fail to explain large market price falls or 'crashes', as such crashes are 
not usually associated with any specific news itemQ For example, Cutler et al. 
[H] find that of the 50 largest daily price falls in aggregate stock prices for the 
period 1946-1987, the majority are not accompanied by external news of specific 
importance^] 

Empirical tests of the RBV have also had limited success in identifying price 
bubbles prior to large price falls (see, e.g., [2] and [27]). ln particular, Donaldson 
and Kamstra [7J estimate a non-linear ARMA-ARCH artificial neural network 
model which allows them to reject the claim that the 1929 stock market crash 
was the outcome of a bubble. One reason for the failure of tests of the RBV 
is the difficulty of explicitly isolating an asset's fundamental value from the 
component of the bubble tied to the asset's market price. 

An alternative approach to modeling stock market crashes and their bubbles 
is to fit a Log-Periodic Power-Law (LPPL) to asset prices]^] The notion that 
statistical description of financial crashes as manifestations of power law accel- 
erations essentially suggests that stock market crashes obey a particular power 
law with log-periodic fluctuations. Sornette et al. [25], Sornette and Johansen 
[25] and Lillo and Mantegna [T7] show that this model is able to capture a shift 
over time in the log-periodic oscillations of financial prices that are associated 
with market crashes. 

The analogy of financial crashes as being similar in their statistical signatures 
to critical points as depicted in natural phenomena has, however, been argued 
to be unrealistic. Laloux et al. [T5J express doubts about the validity of a 
seven-parameter model fitted to highly noisy data. They argue that such a 
model would suffer from severe over-fitting. Also, some log-periodic precursors 
do not always lead to crashes but to a smooth draw-down or to an even greater 
draw-up. This suggests that there is no universality in the manner financial 
bubbles are manifested. Indeed, some evidence (see, e.g., [S]) shows that the 
predicted time of a crash is sensitive to the size of the event-window used to 
predict the crash. 

Whilst the LPPL model is not perfect, it is empirically appealing as it pro- 
vides a forecast of the date by which a financial crash will occur. This is an 
important attribute relative to other methods of financial risk assessment. For 
example, Novak and Beirlant [20, p. 461] argue Extreme Value Theory provides 

Our definition of a stock market crash is similar to that of Hong and Stein \9. , in that 
they represent unusual large market falls that are not followed by large public news events 
and such falls are market wide in nature. 

2 Recently, several specific theoretical models of stock market crashes have been put for- 
ward. In Romer's [21] symmetric rational asset-price model, neither rational behavior nor 
external news plays an important role in stock market crashes. Also both the Hong and Stein 
[3] and Barley and Veronesi £Q models assume that economically significant differences in the 
views of investors can lead to stock market crashes when they are revealed. 

3 Kumar et al. [13] also apply logit models to microeconomic and financial data and show 
that currency crashes can be predicted. 
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a means of predicting ". . . the magnitude of a market crash but not the day of 
the event." As such, the LPPL model appears to contain important statistical 
attributes that require serious empirical consideration. For example, the LPPL 
model contains a component that captures the market's excessive volatility prior 
to a crash. This feature is consistent with several theoretical models of financial 
crashes as well as with empirical results [THl [5] . 

There are several critical considerations associated with fitting an LPPL to 
financial data and we take issue with some of them as follows: first, studies 
that support the LPPL (see e.g., [IT]) show that the parameter estimates for 
this model are confined within certain ranges and that it is these that are the 
indicators of market crashes. This view considerably restricts the number of 
classes of permissible models that fit bubbles preceding stock market crashes to 
just those LPPLs whose parameters fall within the specified ranges rather than 
to any 7 parameter fitted LPPL. Second, the mechanism underlying the LPPL 
is such that prices must increase throughout the bubble, which is largely in line 
with the rational bubbles literature, but which is not what has been found in 
empirical fits of the LPPL (see Section 3.5). Finally, we do not feel that there 
has been sufficient critical analysis of the LPPL and its goodness-of-fit to market 
bubbles. In particular, a goodness-of-fit test is rarely applied and the sensitivity 
of the parameters of the fitted LPPL is usually not reported (see Section |5.5[ ) Q 

The remaining main sections of this paper are as follows: Section[2]introduces 
the LPPL; Section [3] describes the mechanism underlying the LPPL and tests 
it using already published results; Section [4] gives some details of the procedure 
used for finding the parameters of an LPPL so that it best fits the data; Section[5] 
fits the LPPL to pre-crash bubbles on the Hang Seng index, checks on the fits 
already published and tests whether the parameters of the fitted LPPLs do in 
fact predict crashes. 



2 The LPPL 

The simplest form of the LPPL can be written as: 

y t = A + B{t c ~tf {l + Ccos{u]\og{t c -t)+dp)}, (1) 

where: 

4 Laloux et al. 1151 p. 4] report two instances when financial crashes were predicted ex 
ante. The prediction was correct in one case but not in the other despite both predictions 
being published prior to the expected crash date. Indeed, they conclude that ". . . recent claims 
on the predictability of crashes are at this point not trust worthy" . 



3 



m > 


is the price (index), or the log of the price, fit time t] 


A > 


is the value that yf would have if the bubble were to last until 




the critical time t c \ 


B < 


is the increase in y t over the time unit before the crash, 




if C were to be close to zero; 


C 


is the magnitude of the fluctuations around the exponential 




growth, as a proportion; 


t c > o 


is the critical time; 


* < t c 


is any time into the bubble, preceding t c ; 


= 0.33 ±0.18 


is the exponent of the power law growth; 


io = 6.36 ± 1.56 


is the frequency of the fluctuations during the bubble; 


< </> < 2tt 


is a shift parameter. 



The ranges of values given for both and u> are based on the observed param- 
eters of crashes for many stock markets |10) . These ranges for and u>, rather 
than any goodness-of-fit test, are used to identify the bubbles that precede 
crashes. 

Empirical studies that fit the LPPL to financial data make a number of 
claims: 

1. The mechanism that characterizes traders on financial markets is one in 
which they mutually influence each other within local neighborhoods. This 
leads, in turn to co-ordinated behavior through a martingale condition, 
which in the extreme can lead to a bubble and then a crash (see e.g., 

EH). 

2. Financial crashes are preceded by bubbles with fluctuations. Both the 
bubble and the crash can be captured by the LPPL when specific bounds 
are imposed on the critical parameters and w (see e.g., [HURT!?] ). 

3. The established parameters are sufficient to distinguish between LPPL fits 
that precede a crash from those that do not (see e.g., [21] )0 

In this paper, we examine the first two of the above claims and suggest a new 
approach for testing them. The third claim is more controversial; it only makes 
sense to evaluate it once we have a positive evaluation of the second claim. 

3 Is the Underlying Mechanism Correct? 
3.1 The underlying mechanism 

The mechanism driving the change in price during a bubble as posited in Jo- 
hanscn et al. |11| is based on rational expectations, namely, that the expected 
price rise must compensate for the expected risk. The mechanism is a stochas- 
tic process such that the conditional expected value of the asset at time t + 1, 

5 Recently Lin et al. |18[ carried out such an evaluation on a variant of the LPPL model. 
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given all previous data before and up to t, is equal to its price at time t. The 
martingale condition is formulated by Johansen et al. [11] as: 



dp <- Kp(t)h{t)dt, (2) 

where: dp is the expected change in price, conditional on no crash occurring 
over the next time interval dt, at equilibrium; 
p(t) is the price at time t; 

k is the proportion by which the price is expected to drop during 
a crash, if it were to occur; 

h(t) is the hazard rate at time t, i.e. the chance the crash will occur 
in the next unit of time, given that it has not occurred already. 
Under this martingale condition, investors will buy shares at time t if their 
expectation is that the price at time t+X will exceed the price at t by more than 
the associated risk; that is: E(p(t + X)) > p(t)+dp. This buying would drive up 
today's price. So the expected rise in price between today and tomorrow will be 
less (assuming that the expected price tomorrow remains constant); this buying 
will continue until the expected rise is in line with the perceived risk according 
to Eq. [2j And, vice versa, if investors believe that the expected rise in price 
tomorrow will be insufficient to compensate for the risk, i.e. E(p(t + 1)) < 
pit) + dp, then they will try to sell today, going short if necessary, thus driving 
today's price down. 

Note that all the terms on the right side of Eq. [2] are positive, so dp > 
0, i.e., the price must always be expected to be increasing during a bubble. 
This condition was not treated as a constraint in early work (see, e.g., 
and as such gives us the opportunity of treating this requirement as a testable 
prediction^] 

We now follow the consequences of Eq. [2] for the behavior of prices. Re- 
arranging Eq. [2] gives us: 

dp = nh(t)dt, 



Pit) 



log pit) = k / hit')dt'. (3) 



to 

To capture the behavior of the price, the hazard rate, h(t), needs to be 
specified. Here, Johansen et al. [11] posit a model in which each trader i is 
in one of two states, either bull (+1) or bear (-1). At the next time step, the 
position of trader i is given by: 

sign \K ^2 s j + ae i ) > ( 4 ) 

jeN(i) 



6 Sornette and Zhou (26] ■ a later work, does treat this condition as a constraint on the 
permissible parameter values. 
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where: K is the coupling strength between traders; 

N(i) is the set of traders who influence trader i; 
Sj is the current state of trader j; 

a is the tendency towards idiosyncratic behavior for all traders; 
ej is a random draw from a zero mean unit variance normal 
distribution. 

The relevant parameter determining the behavior of a collection of such traders 
is the ratio K/a, which determines a critical value of K, say K c . If K <C K c 
then the collection is in a disordered state. However, as K approaches K c order 
begins to appear in the collection, with a majority of traders having the same 
state. As the value of K approaches K c from below, the system becomes more 
sensitive to small initial perturbations. At the critical value, K c , all the traders 
will have the same state, either +1 or -1. Johansen et al. [TT] further assume 
that: i) the coupling strength K increases smoothly over time up to K c ; and ii) 
the hazard rate is proportional to K. They do not justify these assumptions but 
the first one might be based on assuming that, as the frequency of fluctuations 
increases, traders become less sure of their own judgment and rely more on the 
judgment of their neighbors. In the next sections, we consider the evolution of 
K over time. 



3.2 Simple power law hazard rate 

In the simplest scenario K evolves linearly with time. Assuming that each 
trader has four neighbors arranged in a regular two dimensional grid, then the 
susceptibility of the system near the critical value, K Cl can be shown to be given 
by the approximation: 

X « B"(K C - K)~\ (5) 

where B" > and < 7 < 1 (see [TT]). The three assumptions taken together 
give: 

h(t) w B'(t c - i)~ Q , (6) 

where < a < 1. Substituting in Eq. [3] for h as given by Eq. [6] and integrating 
gives: 



logp(t) = k [ B'(t c -t')- a dt' = -^[{t c -tf 

Jt 1 a 



— a I " 
to 



,:U ' ((*c-*) 1 - a -(tc-to) 1 - a ) 



1 — a 
—kB' 

At * - t c , logp(t c ) = (0 - (t c - to) 1 -") . 

1 — a v ' 

kB' 

So logp(i) = Iogp(i c ) + (t c - tf~ a 

1 — a 

= A + B(t c -tf, (7) 

where: A = \ogp(t c ), B = kB'/(1 — a) and j3 = 1 — a. This is a simple 
exponential growth. 



6 



3.3 Log periodic hazard rate 



To introduce log periodic fluctuations into the exponential growth function we 
need a different form of interconnected structure. Such a structure is assumed 
to be equivalent to one created by: i) starting with a pair of linked traders; ii) 
replacing each link in the current network by a diamond with four links and two 
new nodes diagonally opposite each other. This process continues until some 
stopping criterion is met. Then (see 



X « B"{K C - K)-~* + C"(K C - K)-~> cos(uj\og(K c - K ) + <f>') 
$oh{t) « B'(t c -t)- a [l + C'cos(uj\og(t c -t) + (/)')}, from Eq. [61 

Substituting for h in Eq. [3] from Eq. [8] gives: 



(8) 



logp(t) = K B'{t c -t')- a {l + C cos{i J j\og{t c ~t') + (t>')}dt'. (9) 

J t 

Substituting (3 — 1 — a and tp(t') — wlog(t c — t') + <j>' in the integral 



[ (t c - t')- a cos(u; log(i c -t') + 4>')dt' 

J t n 



{tc-t'f ^ COS ^(t')dt 



(10) 



'o 



u 2 + (3 2 



{u)smip{t') + /3cosip(t')} 



Integrating Eq. fusing Eq. [T0| gives: 
logp(i) = 

± — ( 

{{tc-tf -{t c -t Y} 



—B' 

1 - a 

-B' 



, u _ a _ B'C>%-Hf 



{t c - t'f- a - - - ^ ; ' (wffln^) + /3cos^(f)) 



{(i c - t)' 3 (wsinV'(t) +/3cos-0W) 



logp(i c ) = K 



uj 2 +/3 2 

-(* c - to)' 9 (wsinV^o) +/?cos^(to))}] , 
5'(t c -<o) /3 , B'C'% - to)' 3 



logp(i) = logp(i c ) - 



(w sin ■(/'(to) + /?cost/>(£ )) , 
{ (t c - t) 13 (u) sin ip(t) + /3 cos ip(t))} 



P GJ 2 +(3 2 

nB'{t c -tf kB'C 
f3 ~ ~ lu 2 + (3 2 
= A + B{t c - tf[l + C"(ujsmTP{t) +Pcos<if)(t))} 
« A + B(t c - t) f3 [l + Ccos(tp(t) + 4>')} 
« A + B(t c - tf[l + C cos(w log(t c - i) + ^)], 

where A = logp(t c ), B = -kB'//3, and C" = /3C" /{u 2 + fi 2 ), which is the 
LPPL of Eq. [jjwith y t = log(p t ). 



(11) 
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3.4 Index: raw versus log 



From Eq. (11), it is the log of the price index that needs to be fitted to the 
LPPL although the LPPL is often fitted to the raw index data. Johansen and 
Sornette [T2] justify the use of the raw data by assuming that the price drop 
in the crash is proportional to the price over and above the fundamental value 
rather than being proportional simply to the price. That is, they replace the 
condition [2] by: 



dp <— n(p(t) — pi)h(t)dt, 



(12) 



where p\ is the fundamental value, which they do not further define. 

Johansen and Sornette [Hj introduce the assumption that the rise in price 
since the beginning of the bubble is much less than the amount by which the 
price at the beginning of the bubble is above the fundamental value. Thus 



p{t) -p(*o) <p(*o) -Pli 



(13) 



where to is the time of the beginning of the bubble. Even if the asset's fun- 
damental value is not estimated in the model, the above assumption is weakly 
testable. If the price rise during the bubble is greater than the price at the 
beginning of the bubble, i.e. p(t) > 2p(t ), then the condition of Eq. 13 cannot 



be fulfilled unless the fundamental price is negative. We assume that this is not 
what is intended. So we can test whether or not this assumption is met. 



Integrating Eq. 12 from the moment when the bubble starts, to, and using 
Eq. [13] gives: 



p(t) = p{t )+ / dp 



Pih) + K / (p(i') - Pi)h(t')dt' 



p(*o) + «(p(*o)-Pi) / h{t')dt' 

J to 



(14) 



Provided the assumption in Eq. [13] is met, Eq. [l4]can be used to fit the LPPL 
to raw price (as done, e.g., in [12]) rather than the log price data. 



3.5 Tests of the underlying mechanism 

Chang and Feigenbaum [3] tested the mechanism underlying the LPPL using 
S&P index data for the bubble preceding the 1987 crash. They tested whether 
the observed price changes could be fitted to a LPPL and whether price changes 
could be predicted by a random walk model. To do so they first extended the 
LPPL as given in Eq. [T]by adding: 

• a random term with zero mean and variance estimated from the data. This 
noise term is necessary to compute a likelihood for the observed data devia- 
tions from the predicted LPPL. 
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• a positive upward drift term estimated from the data. This addition to the 
LPPL, while frequently made in financial time series, is unnecessary here as 
exponential growth is posited in the LPPL. 

Then they estimated the likelihood of the observed change in price since the 
previous day t — 1, and selected parameters that maximized the sum of these 
likelihoods over the entire bubble. 

With a time series there is a choice of which next point to take as being 
the most likely: either the predicted value or the predicted change since t — 1. 
Using the model's prediction of the value at t ignores the value at t — 1; this 
is what Johansen et al 11 implicitly assume when they minimize the RMSE 
for the fitted LPPL against the data. On the other hand, using the predicted 
change since t — 1 ignores any deviation that the price at t — 1 already has 
from the model's prediction for t — 1. This is what Chang and Feigenbaum jl] 
explicitly do. So they have established simply that the mechanism underlying 
their adaptation of the LPPL, when judged for each time point separately, is 
not to be preferred to the random walk model, which is hardly surprising given 
what is known about the random nature of stock market prices. 

While most of the assumptions underlying the mechanism from which the 
LPPL is derived are untestable (or even questionable), there is one that is 
testable: the hazard rate h must be positive. This implies that the price must 
always rise. If the fitted LPPL does not have this property, then the assumption 
that h(t) in Eq. [2] is a probability, must be rejected. Figure [I] shows the LPPL 
fitted to the raw Hang Seng index data for the bubble preceding the 1989 crash. 
The fit is similar to Figure 8 of Sornette and Johansen [23]. The LPPL in 
Figure 1 has a negative slope some of the time. The same is true in 16 of the 
30 cases reported by Sornette and Johansen [T2], [24] Q That is, the fitted LPPL 
predicts that the price decreases at some time points. Unless we are mistaken, 
this empirical fact is sufficient to reject the martingale condition as being the 
mechanism underlying the LPPL fit to pre-crash bubbles. However, this does 
not affect the ability or otherwise of the LPPL to fit the data. 



4 Fitting the LPPL Parameters 

The seven parameters of the LPPL in Eq. [I] have to be estimated from the 
window of data points in the bubble. The chosen values of these parameters 
should be the ones that minimize the root mean squared error (RMSE) between 
the data and the LPPL prediction for each day of the bubble. The squared error 
between the prediction from the fitted curve from Eq. [T] and the data is: 

7 These 16 pre-crash bubbles are: the Dow Jones (1929, '62), S & P ('37, '87), Hang Seng 
('89, '94), Argentina ('91, '92, '97) and various stock market crashes of 1994 (Indonesia, Korea, 
Malaysia, Philippines) and 1997 (Indonesia, Mexico, Peru). 
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t„ t„ 

SE = J2(y t -y t f = J2{y t -A- B(t c - tf (1 + CcOs( W log(t c - t) + <j>)f , 
t=t ± t=t x 

(15) 

where: yt is the data point, either the price index or its log; 

y t is the data point as predicted by the model; 

n is the number of weekdays in the bubble; 

ti is the calendar day date of the i th weekday 
from the beginning of the bubble. 
Partially differentiating Eq. [l5]with respect to the parameters A, B and C gives 
us three linear equations from which the values of A, B and C that minimize 
the RMSE are derived, given the other four parameters: /3,aj,i c and <j). To 
find suitable values for these four parameters a search method is required. The 
method used in [12] and [24], hereafter collectively called the JS studies, was: 

• First to make a grid of points for the parameters u> and t c , from each of which 
a Taboo search was conducted to find the best value of f3 and <j>, i.e. the ones 
for which, with A, B and C chosen to minimize the RMSE, gave the lowest 
RMSE. 

• To select from these points those for which < (5 < 1. 
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• From these points to conduct a Nelder-Mead Simplex search [T!5] , with all the 
four search parameters free (and A, B and C chosen to minimize the RMSE). 



We presume that the reason that any fit with /3 > 1 was rejected is because 
then the increase in the index is exponentially declining whereas the underlying 
mechanism requires it to be increasing. An alternative technique would have 
been to place no restriction on the value of (3, and if a value of (3 > 1 is found, 
to reject the model, as we have done for the requirement that the fitted LPPL 



never decreases (see Section 3.5 ) 



Similar to the JS studies, we use a preliminary search procedure based on a 
grid to provide seeds for the Nelder-Mead Simplex method, as implemented in 
Matlab Q3]. It is based on choosing different values for the two parameters ui 
and P, as these are the critical parameters for determining whether the fitted 
LPPL is a crash precursor or not (see Eq. [I]) . The algorithm and the parameter 
values used are shown in the Appendix. Note that instead of the crash date, t c , 
we use t2c, the number of days between the day on which the estimate is being 
made and the predicted critical date. 

5 Empirical Results 

5.1 Data and descriptive statistics 



Table 1: Descriptive statistics for changes in the log of the Hang Seng stock 
index. 





N 


Mean 


Variance 


Skewness 


Kurtosis 


J-B 


HS Index returns 


10152 


0.00045" 


0.00035 


-1.25934" 


31.58011" 


424542.78.9" 



The mean and variance arc multiplied by 100. 
J-B is the Jarquc-Bera statistic for testing the null hypothesis of normality, 
"and b denote statistical significance at a 1- and 5- percent level, respectively. 



To test the LPPL, daily observations for the Hang Seng were downloaded 
from Data-stream. The data spans the period 1 st January 1970 to 31 st De- 
cember 2008. We analyze the Hang Seng index since it is commonly believed 
that this market has had several crashes, thus giving us ample opportunity to 
test the LPPL j^| Descriptive statistics, shown in Table [l] reveal that the mean 
log changes of the Hang Seng index series are significantly different from zero. 
Both skewness and (excess) kurtosis are significant such that the Jarque-Bera 
test rejects the null of normality at a 1 percent level. Notice that skewness is 
highly significant and negative. This finding suggests that the Hang Seng stock 

8 This suggests that stock market crashes can be common. Indeed, using a statistical 
method to identify outliers, Schluter and Trede 1221 show that the 1987 stock market crash of 
the Dow Jones Industrial index was not a structurally unusual event. 
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HS index and identified crashes using: peak. since, drop. to and drop. by (see Legend) 
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Figure 2: The Hang Seng index 1970 to 2008, showing those peaks that are 
initiators of crashes. 

market can be very sensitive to stock market crashes. That is, volatility feed- 
back can increase the probability of large negative returns and in turn, increase 
the potential for crashes [3]. 

5.2 Identifying a crash 

As indicated earlier, we use a definition of stock market crashes similar to that 
of Hong and Stein [S]. To test whether or not the LPPL can predict crashes 
we first need to identify the crash itself. Usually a stock market crash is taken 
to mean a very large and unusual price fall. In our application, a crash can 
span more than one day. This is consistent with the October 1987 stock market 
crash. 

There are two situations when we might falsely claim that a crash has oc- 
curred. One is when the index is on the way up in a bubble and then there is a 
large drop, but it turns out that the drop is temporary and the bubble contin- 
ues. The other is when, on the way down during a crash, the index experiences 
a recovery and so we identify the beginning of a new bubble but the recovery 
is temporary and the anti bubble is still in effect. To avoid those situations, we 
identify a peak as one initiating a crash as follows: 



• A period of 262 weekdays prior to the peak for which there is no value higher 
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Drops on the HS, by weekdays to drop; for each crash 
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Figure 3: Drops from peaks on Hang Seng index 1970 to 2008. 



than the peak's. 

• A drop in price of 25%, i.e. down to 0.75 of the peak price, which is in line 
with the 1987 crash. 

• A period of 60 weekdays within which the drop in price needs to occur. 

We first want to determine whether the application of these criteria enables 
us to capture the eight crashes on the Hang Seng index, as identified in the 
JS studies. Indeed, we identify crashes at the same time points as in the JS 
studies, except for one additional crash in 1981 (see Figure [2]). To exclude the 
price fall in 1981 from being classified as a crash, we would have to increase 
the drop-to criterion or reduce the drop-by criterion. Doing either would also 
exclude some of the other peaks as initiating crashes, viz. those peaks that 
immediately preceded the crashes of 1978, 1994, 1997, all of which are identified 
as crash initiators in the JS studies (see Figure [3]) . Thus the rule they apply 
seems somewhat imprecise. It is true that the 1981 crash occurs shortly after 
the 1980 crash, so we might exclude the 1980 peak as initiating a crash, but 
rather being a part of the bubble preceding the 1981 crash, but this is not what 
was done in Sornette and Johansen [53]. It would also be possible to exclude 
fitting an LPPL to the bubble preceding the 1981 crash on the grounds that 
this bubble is too short - just 7 months long. However, another bubble (the one 
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preceding the crash 1971) was fitted even though it lasted only 6 months. As 
such, the bubble preceding the 1981 crash should have been included in the JS 
studies, unless one insists on having more than say 7 months of data preceding a 
crash. On balance, we believe that it is appropriate to include the 1981 crash we 
have identified, giving us nine crashes for the period of the JS studies. Overall, 
the criteria for identifying a crash does not appear to be consistently applied in 
the JS studies. 

In the period after the JS studies, i.e. between 2000 and 2008, our criteria 
identify two additional peaks as initiating crashes; these are in 2000 and in 
2007. The two bubbles preceding these crashes provide a post-hoc test of the 
hypothesis underlying the LPPL Eq. [T] 

5.3 Troughs and bubble beginnings 

Having decided that a peak is the initiator of a crash, the data window to be 
used for fitting the LPPL to the preceding bubble needs to be carefully selected. 
In the JS studies the start of the data window is taken to be the day on which 
the index reaches its lowest value "prior to the change in trend" [T^]. In real 
time matters are not so simple, since one does not know if the index will drop 
still further in the future. So for real time analysis we would need to take as the 
end of the previous crash the lowest point since the last crash, up until now. 

Moreover, Johansen and Sornette |12j sometimes move the beginning of the 
bubble from the lowest point since the previous crash to a later time as in their 
Asian and Latin-American study. This was done if "at the trough the next 
bubble had not yet begun" (Johansen, personal communication). From the JS 
studies, we deduce that this was done for four of the eight crashes they identified 
on the Hang Seng: 

• 1971 crash: forward 2 months, from 5/1/1971 to 10/3/1971; 

• 1978 crash: forward 3 years and 1 month, from 10/12/1974 to 13/1/1978; 

• 1987 crash: forward 1 year and 8 months, from 2/12/1982 to 23/7/1984; 

• 1994 crash: forward 2 years and 2 months, from 5/6/1989 to 19/8/1991. 

These are indicated by squares in Figure [4] 

It is clear why Johansen and Sornette [H] moved the beginning of the bubbles 
for the 1978 and 1987 crashes to times later than the trough proceeding the 
crash. For 1978 there was a long period of stable prices which is clearly not 
part of a bubble. For 1987 the year and 8 months following the trough are 
characterized by two mini bubbles and two peaks (which with other crash criteria 
would themselves be considered initiators of crashes). It is not so clear why they 
moved the start points of the other two bubbles (preceding the 1971 and 1994 
crashes) forward. 

In the JS studies, a model fit is only made if there are at least 131 weekdays 
of data between the trough and the crash. Changing the number of days could 
lead to different bubbles being considered as crash precursors. To illustrate this 
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HS index and identified crashes using: peak.since, drop.to and drop.by (see Legend) 
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Figure 4: Troughs and other beginnings of bubbles on Hang Seng 1970 to 2008. 
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for the Hang Seng data, there are only 155 weekdays between the end of the 
1980 crash and the peak in 1981 when it appears that another crash occurred. 
To require (say) 262 weekdays would result in insufficient data, and thus exclude 
the bubbles before both the 1981 and the 1971 crashes, thus affecting the results. 
This means that one needs to be very careful in implementing the rule, given 
the data under consideration. 

5.4 Fitting to the raw index 



Table 2: Ratio of raw Hang Seng index on the last day to index at the beginning 
of the bubble. 





Bubble 






Raw Hang 


; Seng: 


Ratio: 


beginning 


at to 


ending 


on t e 


p(*o) 


P(*e) 


p(f e )M*o) 


*10-Mar- 


1971 


20-Sep- 


1971 


201 


406 


2.02! 


22-Nov- 


1971 


09-Mar- 


1973 


279 


1775 


6.36! 


*13-Jan- 


1978 


04-Sep- 


1978 


383 


707 


1.85 


20-Nov- 


1978 


13-Nov- 


1980 


468 


1655 


3.54! 


12-Dec- 


1980 


17-Jul- 


1981 


1222 


1810 


1.48 


*23-Jul- 


1984 


01-Oct- 


1987 


747 


3950 


5.29! 


07-Dec- 


1987 


15-May- 


1989 


1895 


3310 


1.75 


*19-Aug- 


1991 


04-Jan- 


1994 


3723 


12201 


3.28! 


23-Jan- 


1995 


07-Aug- 


1997 


6968 


16673 


2.39! 


13-Aug- 


1998 


28-Mar- 


2000 


6660 


18302 


2.75! 


23-Apr- 


2003 


30-Oct- 


2007 


8520 


31638 


3.71! 



to the day the bubble began. 
t e the last day of the bubble. 

* Bubble beginning moved to a time later than the trough between peaks. 



! The ratio p(t e )/p(to) > 2, so the raw index should not be used (see Eq. 13 1 



In the JS studies, for all but the 1973 crash, the LPPL has been fitted to 
the bubble in the raw index rather than to the log of the index. For this to be 
justified, the inequality in Eq. |13| must hold. That is, the price rise during the 
bubble must be considerably less than the difference between the price at the 
beginning of the bubble and the fundamental price. If we make the reasonable 
assumption that the fundamental price cannot be negative, then at any time 
during the bubble the price must at the very least not be more than double that 
at the beginning of the bubble. If we compare the price at the time of fitting, 
prior to the crash, to that at the beginning of the bubble, then we see in Table 
[2] that this condition is met for only two of the eight bubbles found in the JS 
studies. For the remaining six bubbles this condition does not hold, i.e. the 



price more than doubled during the bubble, so the inequality in Eq. 13 which 
is the assumption upon which the raw rather than the log of the index can be 
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chosen, was violated. Despite this, in the JS studies five of these six fits of the 
LPPL are made to the raw index rather than to its log; they should not have 
been. 



5.5 Sensitivity to search parameter values 

Identifying an LPPL fit to a bubble as one that precedes a crash depends on 
the two critical parameters (3 and cj; so it is important to examine how sensitive 
the RMSE of the fit is to variations in these parameters. We use the bubble 
preceding the 1989 crash on the Hang Seng to examine the sensitivity of the 
RMSE of the LPPL fit to variations in each of the four search parameters ui, tj> 
and t c ); the other three parameters (A,B and C) are always set using these 
four (see Section [4]). The results are shown in Figure [5] The circle indicates the 
chosen parameter value. While the chosen values of the search parameters are 
at global minima, the RMSE is highly sensitive to small fluctuations in the value 
chosen for u. The sensitivity diagrams for the other Hang Seng bubbles listed in 
Tableware similar to those shown in Figure [5j In general, the search procedure 
can easily get trapped in a local minimum for the u> parameter. Consequently 
the value found by the search procedure for lj may not be the one that leads to 
the minimum RMSE. As the value of lo is used in predicting whether or not the 
bubble will be followed by a crash, this is a serious problem. 



5.6 The 'best' fits of the LPPL 

We now fit the LPPL to the raw data for each of the bubbles preceding the 
11 crashes identified for the Heng Seng index over the period 1970 to 2008 
(as selected by the criteria in Section 5.2 1, using the minimum RMSE as the 
criterion for best fit. For each crash: 
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Figure 5: Sensitivity of the RMSE to the parameters of the LPPL for 1989 Hang 
Seng crash. 
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Table 3: The bubbles and crashes of the Hang Seng index and LPPL fits to the 
raw bubble data. 







A 


B 


C 


P 




t2c 




RMSE 






HSI 


HSI 






rads 


days 


rads 


HSI 


Bubble: 


Ref: 


low: 






0.15 


4.80 


1 







from/to 




high: 







0.51 


7.92 


? 


7T 




*10-Mar-1971 


SJ 


594 


-132 


-0.033 


0.20 


4.30 


7 


0.50 


7.58 


20-Sep-1971 




539 


-101 


-0.047 


0.22 


4.30 


3 


0.25 


6.11 


22-Nov-1971 


SJ 


11 


-3 


0.003 


0.11 


8.70 


2 


0.05 


0.0722 


09-Mar-1973 


log 


65 


-56 


-0.001 


0.01 


11.1 


20 


1.32 


0.0538 




log 


8 


-0 


-0.177 


0.57 


1.47 


2 


3.14 


0.0549 




raw 


2443 


-485 


-0.114 


0.26 


1.45 


2 


3.14 


40.91 


* 13- Jan- 1978 


SJ 


816 


-50 


-0.053 


0.40 


5.90 


6 


0.17 


10.09 


04-Sep-1978 




741 


-23 


0.072 


0.51 


5.30 


1 


0.00 


10.12 


20-Nov-1978 


SJ 


1998 


-231 


-0.044 


0.29 


7.24 


3 


1.80 


46.72 


13-Nov-1980 




41164 


-38080 


0.001 


0.01 


7.51 


52 


3.06 


35.02 






7929 


-5352 


0.008 


0.05 


6.79 


26 


1.55 


35.55 






1998 


-231 


-0.044 


0.29 


7.24 


3 


2.63 


37.00 


12-Dec-1980 




















17-Jul-1981 




1753 


-0 


-0.890 


2.41 


3.02 


1 


3.14 


40.46 






1817 


-3 


-0.567 


1 


4.75 


12 


0.35 


49.24 






1946 


-11 


-0.399 


0.76 


5.89 


36 


0.00 


54.95 


*23-Jul-1984 


JS 


5262 


-542 


-0.007 


0.29 


5.60 


22 


1.60 


133.86 


Ol-Oct-1987 




5779 


-711 


0.048 


0.27 


5.68 


34 


2.63 


68.47 


07-Dec-1987 


SJ 


3403 


-32 


-0.023 


0.57 


4.90 


34 


0.50 


133.21 


15-May-1989 




3575 


-53 


-0.195 


0.52 


4.95 


31 


1.74 


76.33 


*19-Aug-1991 


JS 


21421 


-7614 


0.024 


0.12 


6.30 


4 


0.60 


322.80 


04-Jan-1994 




212635 


-194575 


-0.002 


0.27 


5.95 


1 


3.13 


272.82 






14038 


-1717 


-0.028 


0.26 


6.43 


4 


3.14 


281.36 


23- Jan- 1995 


JS 


20359 


-1149 


-0.019 


0.34 


7.50 


51 


0.80 


531.79 


07-Aug-1997 




20255 


-1201 


-0.048 


0.33 


7.47 


51 


2.29 


438.79 


13- Aug- 1998 




















28-Mar-2000 




21918 


-16 


0.073 


1.00 


18.35 


290 


0.00 


710.99 






24095 


-97 


-0.057 


0.76 


17.51 


264 


3.14 


720.17 






19503 


-372 


0.111 


0.52 


5.7 


9 


2.07 


744.15 


23-Apr-2003 




















30-Oct-20O7 




38940 


-6408 


0.019 


0.20 


5.41 


1 


3.14 


693.61 



HSI is the units of the Hang Seng Index. 

Rcf : JS denotes Johansen and Sornette [12]; SJ denotes Sornette and Johansen [24j . 

* Bubble beginning moved to a time later than the trough between peaks. 

/3 was constrained to be > 0.01, so a (3 = 0.01 indicates that the optimal value of 
P < 0.01. 

t2c number of days from date of the fit until predicted crash date, i.e. t2c = t c - 
today. 

Bold highlights those values of /3 and uj that fall well outside the range specified in 
Eq. [TJ shown in the top two rows of the table. 



• The first line of Table [3] shows the parameters of the LPPL fit as given in 
the JS studies, but with the linear parameters A,B and C recalculated for 
time expressed in days rather than years. As the RMSE was not reported for 
the JS studies (except for the LPPL fitted to the bubble preceding the 1997 
crash) this too has been recalculated by us. 

• The second line shows the parameters for our best fit to the raw data. The 
results are based on the raw data, despite our reservations about its appro- 



priateness (Section 5.4), because we want to compare our results with those 
of the JS studies^ 

• If this is not within the bounds for a crash prediction, then subsequent lines 
show the next best fit that is (or might be). 

Variation in the values of the critical parameters f3 and uj sufficiently large 
to take them across their acceptable boundaries lead to only quite small fluc- 
tuations in the RMSE. This can bee seen, for example, for the crashes of 1973 
and 1980 (see Table [3). 

We were interested in comparing our LPPL fits to those found in the JS 
studies. However, given the high sensitivity of the RMSE to small changes in 



the value of uj (see Section 5.5) and as the values for j3 and uj were reported 
to only one decimal place in the JS studies, our re-calculated RMSEs will be 
different from those that were obtained in these studies. We can see this in the 
bubble ending in the crash of 1997, where we have not only our recalculated 
RMSE using the parameters rounded to one decimal place, but also the RMSE 
using the unrounded parameter values as found by Johansen et al. JT]; the 
latter fit is considerably better than our recalculation (RMSE=436 rather than 
532 Hang Seng Index units) . This improvement is almost certainly due to using 
the exact rather than the rounded value of w. So caution needs to be taken 
when comparing the RMSEs for the fits reported in the JS studies and our fits. 

Of the eight pre-crash bubbles fitted in the JS studies we find virtually the 
same parameters for the LPPL for six of them; namely, those preceding the 
crashes of 1971, 1978, 1987, 1989, 1994 and 1997. However, for their other two 
bubbles we found different parameters as follows: 

1973: For this bubble, Sornette and Johansen [24] report the fit to the log of 
the Hang Seng index, rather than to the raw index. We have used both the 
log and the raw index. When we fit the log of the index we find a better 
fit than that reported in [53] with values of both (3 and uj outside their 
acceptable ranges. For comparison with other bubbles we also fitted the 
raw index; we find that the best fitting LPPL has a value for f3 = 0.26, 
which is within the acceptable range of 0.15 - 0.51, but for uj = 1.45, 
which is well below the lower bound of its critical range of 4.8 - 8.0 (see 
Equation [TJ . 



9 For the crash of 1973 Johansen and Sornette |12| used the log instead of the raw index, 
so we report both log and raw fits specifically for that year. 
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1980: We were able to reproduce the fit reported in [53], with a crash predicted 
3 days later, but it was not the best fit that we found. Our best fit 
predicted a crash after 52 days, and had critical parameter values uj = 7.51, 
which is acceptable, but j3 = 0.01, which is outside the acceptable range. 

There are three pre-crash bubbles that were not considered in the JS studies; 



one, in 1981, they did not consider a crash (but see Section 5.2), and two others 
were later than their period: 

1981: We find a best fit for which both /?(= 2.41) and lu(= 3.02) are well outside 
their acceptable ranges. As j3 > 1, we surmise that this fit would have 
been rejected by the criteria used in the JS studies (see Section El). The 
first fit that has a (3 <— 1 has to — 4.75, which is just acceptable, but with 
a (3 = 1, i.e. no power law, so well outside its acceptable range. It might be 
argued that this peak was too soon (8 months) after the trough following 
the previous crash of 1980 for an LPPL to be fitted on the grounds of 
there being insufficient data. But, as we have argued in Section [572] we 
believe it should have been. 

2000: Our best fit to the bubble has both critical parameters /3(— 1.00) and 
lu{= 18.35) well outside their respective acceptable ranges. There is a 
fit that does have these parameters within their acceptable ranges, and 
predicts a crash after only 9 days; but it is not the best fit. 

2007: Our best fit to this bubble has parameters well within the ranges required 
for a crash and the crash is predicted for the day it actually occurred. 



6 Conclusion 

The LPPL for pre-crash bubbles on stock markets, as reported in Johansen et 
al. [IT] and the JS studies, has important consequences. Our analysis has led 
us to the following conclusions. 

The mechanism proposed to lead to the LPPL fluctuations as reported in 
Johansen et al. |llj must be incorrect as it requires the price to be increasing 
throughout the bubble (a constraint recognized later in |26j). but in half the 
studies reported the LPPL fitted to the index (or its log) decreases at some 
point during the bubble. Hence either another explanation is required or the 
fits have to be redone with a constraint on the parameters that leads to LPPL 
fits that never decrease. Also, in the JS studies the fits were made to the raw 
rather than the log of the index for all but one (1973) of the eight bubbles, 
even though the assumption upon which the use of the raw rather than the log 
should be used was certainly not met in six of these seven bubbles. So, on both 
counts, these studies should no longer be used to support a conclusion that the 
proposed mechanism underlies the LPPL. 

Identifying crashes and bubble beginnings was not well specified in the JS 
studies. In particular it is not clear why one peak, that of 1981, was not identified 
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as a crash initiator. Moreover, moving the trough that marks the beginning of 
a bubble forward by 'eye' in half the data sets is not really satisfactory. While 
we have taken more care in identifying those peaks that initiated crashes, we 
have still, for comparison, used the same bubble beginnings as used in the JS 
studies; in future a clear criterion needs to be established. 

In the JS studies, the fits of the LPPL to the data were only accepted if 
the exponential parameter ft was < 1. That is, the fits showed an exponential 
increase. It would be stronger to reject the LPPL if a /? > 1 is found. 

In our study the two critical parameters of the fitted LPPLs, (5 and uj, do 
fall within acceptable ranges in 7 of the 11 bubbles. Of the remaining four 
bubbles, an LPPL with critical parameters within their respective acceptable 
ranges could be found for all but one crash (1973). However, these LPPLs did 
not have the best fits (minimum RMSE). For one crash (1980) the best fit would 
be acceptable if the lower end of the acceptable range of /3 was decreased, i.e. a 
range of 0.01 - 0.51. For another (1981), a fit with (3 > 1 would also have to be 
ruled out to save the hypothesis. For two crashes (1973 and 2000), there seems 
to be no saving strategy. That the bubbles leading to the 1981 and 2000 crashes 
do not satisfy the criteria is particularly negative as these are two of the three 
crashes for which the ranges on the critical parameters were not set post hoc in 
the JS studies. 

Finally, while the objection that with seven parameters a curve can be fitted 
to any data [TS] is not directly relevant as no goodness of fit is measured here, 
it is indirectly highly relevant. The RMSE of the fit of the LPPL model (Eq. 
[T]) to the data is highly sensitive to small but not to large fluctuations in one of 
the critical parameters (u>); this makes the search for the LPPL that minimizes 
the RMSE unreliable. Moreover, substantial fluctuations in both parameters 
together can result in quite small changes in the RMSE. This suggests that 
the permissible ranges for these parameters should not be independent of one 
another. 

Despite these criticisms, and because of the partial success of the hypoth- 
esis, in particular for predicting the 2007 crash, we believe that it is worth 
investigating whether fitted LPPLs with critical parameters in acceptable non- 
independent ranges can be used to give a probabilistic, rather than an all-or-none 
prediction of an impending crash. This will be the subject of on going work. 
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Appendix: Search algorithm 



0. For each of the four parameters (3, ui, t2c and (j), fix the lower L and upper 
U bounds for the seeds. For a subset V of selected parameters ((3 and w) , 
fix the minimum width W to continue searching. 

1. Choose as the current seed SI <— (L + U)/2, the mid point of the current 
lower and upper bounds. 

2. Run the unbounded Nelder-Mead Simplex search from the current seed 
SI, which will return a solution S2. 

3. Construct a hypercube in the space of V using SI and 5*2, with their 
minimum as the bottom corner: B 4— min(51,52); and their maxima as 
the top corner: T <— max(Sl, S2). 

4. For p <— 1 : size('P), i.e. for each of the selected parameters, do: 

if Bp — L p < W p i.e. if there is too little space under the hypercube on 
the p th dimension in V, set B p L p , i.e. set the bottom of the 
hypercube on the p th dimension to its lower bound, 
else recursively search from step 1, with L' L and U' U,U p 
Bp, i.e. search under the hypercube; 

if Up — T p < W p , i.e. if there is too little space above the hypercube on 
the p th parameter, set T p ^— U p , i.e. set the top of the hypercube on 
the p th parameter to its upper bound, 

else recursively search from step 1, with L' L,L' p <— T p and 
V <— U, i.e. search above the hypercube. 



Initial bounds on the four parameters for selecting seeds 







rads 


t2c 
days 




rads 


lower 








1 





upper 


2 


20 


260 


7T 


minimum width 


0.2 


2 
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